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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 3 891 base oairs 

(B) TYPE: nucleic acid" 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 151.. 1425 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 ; 
CTCTTTGCCT CCTCCCTGTT CCAGGAGCTG GTGCCCTGGG CTCTGCGCTG TTGTTTTCAG 
CGTTCCGAAA GCCQGOGCTT GAGATCCAGG CAAGTGAATC CAGCCAGGCA GTTTTCCCTT 
CAGCACCTCG GACAGAACAC GCAGTAAAAA ATG GCT CCG ATC ACC ACC AGC CGG 

Met Ala Pro He Thr Thr Ser Arg 
25 5 
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280 



A^n f 36 ^ o CT TCA GGA CGG 007 GAG TTA CTG ATC TCT CTC TGC 

Asn val Arg Lys Ser Ser Gly Arg Gly Glu Leu Leu lie Jer Leu ?ys 

285 290 29S 



35 



32S 



45 



SI? ^ ^ ^ GC ACC S CC GCA GTG TTC AAT GAG CTG TTT GTC TTT 

Val Lys Lys Cys Thr Pro Asn Ala Val Phe Asn Glu Leu Phe Val Phi 

3S0 355 ago 

GAT ATT CCT TGT GAG GGC CTT GAA GAT ATA AGT GTT GAA TTT TTG GTT 
Asp lie Pro cys Glu Gly Leu Glu Asp He Ser Sal GlJ J£ S2 Sal 
- J65 370 375 

TTG GAT TCT GAA AGG GGG TCC CGA AAT GAG GTA ATC GGG CAG TTA GTC 
Leu Asp Ser Glu Arg Gly Ser Arg Asn Glu Val lie S? cTn llu vll 



400 
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AAATTTTCTT TCTTTCTATA TTGGATTGCA AGCTTGGGAA ATCAAGCTAC CTTTTTGTTG 
TTGTTGTTGT TGCTAGAAAT GGATTGAATT AGTAGACCAG AAAGTAACTT CAAATGTGTA 



894 



220 22S 23Q 

t7- r^ ^ J™ G ? C TTG CAC TTC ACA ATT TTG AGT TTT GAC AGG TTT 
He Gin Glu Leu Ala Leu His Phe Thr He Leu Ser p£ Sp J£ 

240 245 

TCA AGA GAT GAT ATC ATT GGG GAA GTT CTA ATT CCT CTC rrr rva , m 

Ser Arg Asp Asp He He Gly Glu Val Le2 ill Pro 25 J5r nl 942 

20 255 260 

GAA TTA TCT GAA GGA AAA ATG TTA ATG AAT AGA GAG ATC ATC AAr 
Glu Leu Ser Glu Gly Lys Met Leu Met Asn Arg £2 n5 n5 £2 jgj 990 



1038 



?n £ AT S AG TCC ACC ACA ACT CTA ACT GTG GTT GTC TTA AAA GCT rr» 

30 Tyr Gin Ser Thr Thr Asn Thr Leu Thr Val Val Sal 2J aS £5 1086 

0 305 310 



CAT CTG CCT AAA TCT GAT GTG TCC GGA CTT TCA GAT rrr tat rrn »»» 
Hxs Leu Pro Lys Ser Asp Val Ser Gly Leu IS Sp Pro SI? " 34 



SI? 25 S 35 35 S ^ JJS JS !S S £5 SS « S K 1182 

40 335 340 



1230 



1278 



50 £2 2» is a; ss s? s s ss a; ss s s? ss ™ gtc 1326 

380 385 390 

TTG GGT GCA GCA GCA GAA GGA ACT GGT GGA GAG CAC TGG AAA GAG ATC 
55 Leu Gly Ala Ala Ala Glu Gly Thr Gly Gly Glu 25 ^ gJS h5 



1374 



1422 



r?Z n AC ^ AC S CC AGG AGA CAA ATT GCC AAG TGG CAC GTG CTC TGT GAT 
Cys Asp Tyr Pro Arg Arg Gin He Ala Lys Trp His SI? LeS lyl £J 

60 415 420 

GGT TAGCATCCTA GC CGTGAGTT GGAACTTAAA GGTTTTTACT AGGCAAGGAG 1 475 
425 



1535 
1595 
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TTATGATAAT TTCCCTATTT ATTAGAAGAG TTGGATAAAT TTTCATAAGA TATTCAATAT 1655 

CTCCTTCAGA TTACCAGTGA TATAACTAGG AATAGTCAGA CATTTTATGA ATACTGTGCC 173.5 

5 AGAATCCCAA ATTATAAATG TGACAATCTC ATTGGAACAT GTCACAAAAA GTTAATGTGA 17 75 

TTAAGATTTA AAAACGAAAA GTATGCCTTG CCTTGTGAAA ATTTATC CAT TTATCTTCAG 1835 

iQ GTTGGGGGAA ATCAATTTTT CTTTAATCCA AAGATACTAA AAAAATGTCC TCCAGTTTGT 1895 

ATTTATTAAT TCTGTCATGT GCAAATGGTT GTCCTGCATA TAAAAGTATC TGGTCATTTC 1955 

AGTTTGGTTT GTAATTATTT GATGCAATTT TATCATAAGA GTAACTCAGA TTCATTTCAA 2015 

15 AAGGACAGTG AACAAGCTGA GAAATTATTT TATCAAAGGG CTGAGTTGAG AACACTGTGG 2075 

CTGAAATATA ATTTTTCTCC CCCCTAAGGT TACATGTGAG TCAAAATTTT GTAAAATATA 2135 

2q ACCTCACATA AGAACCATGG CCTTGGATTA TTCACTGCCT GTCACAAGCC TCAGTGTGGC 2195 

CTGAGAAATC CCTATGTACC TTTGTGAAAT TGTTGAATTA GTTAGTGAAT AAAGAAATAA 2255 

ACTTCAACTA GAAATCCAGT TAGAAGTGCA ATTTTCTTAT AGGAAATAGG TATAGTGTGC 2315 

25 AAGTGTACTT TTAAGGCCAT CGTTTGTACC CAGAGTCGGC ATGGCCACCT AAGTCTTCAT 2375 

TTAATTTATT GTCCCCCAGA AAAGATTAAG ATGCTACTTG AAAAGACTGT GAAGATTTTT 2435 

3Q TACATTGCCA GATAAAAAGT GTTACTTAAC CAACAAACAA ATGTAAGACT ACAAAATCGT 2495 

TCAAGAGCAA TTCTAATATA ATTTACATAT GTTCACGCAA AATATGCTTA GGCTGTCAAA 2555 

TTAGCACAAC AAAGAATGTG TTTCACTATC TTTTCTAGGC TAATTTGTCT TGAGCTGTTG 2615 

35 TCTATAGAGC AGTTTACAGA CTTGTGTCTT GTATCATTTT CCAGTGCCAG GGTTCTGAAA 2675 

TTCATTCAGA ACCTGTTAGA TTAAAG CTGC ACCCTGTGAT TATTTGAAAA GAATTAGCTT 2735 

4Q GAGAGTAATG TCACTATATT TGAGTTCTTA GAGAAGTATG AGTGGAACTT GAGTACAGTT 2795 

GAATTATTAA ATATGCAAGT TAGAAATTAA GTCTACTGAA AAATTTACAT TTTGAGTCAG 2855 

GTTTTGTGTC AGTACTTTAG CAGTTTTTGA GAATGTGTTT GATATCACAG TGTTTGTAAA 2915 

45 TTCTATGAAA AATGCATTTT CCAAACAACT TATACATGCT TTTTATGACT ATGCCTAATG 2975 

TAAAGAAAAT GTATTACATT CTGTATGTAC AAAGATTAAA AATCAACCTC TTTTTTGTGC 3035 

5Q TTTAAAATGA CTTTGGGATT AAAAAAGCAT ATTTCCCAAT CATTGTCTTC ATTCCACTAC 3095 

AAAGTCACCT CACAGCATCT TGCTCCACTC GGCATCTCTG TGAAAGCAAC ATGAAATGAA 3155 

CTGTAGTAGG TGTGTAGTTT GGGGAAGTCA AATGGCCATT TTATGTATGT GCATTTGGTA 3215 

55 TCATGGGCCG TGGAACAGAA TATATGTTGG ACCTCTGAAA AGTTGTAAGG GGCCAAATCT 3275 

AAGTATTCTT CACGGCAGCC AGAAGTTAAT GGTGGTAGCA GCTGAGGTAT GGTTGTTGGA 3335 

CGAGGCCGAT TTTTTTTTTT TAACATGGAA CAATGAAACC AACAACAAAC ATTTTTAAAA 3395 

TTAAAATGGA TAATTTGTAA ATAGTTTTTA GCTTTTAAAA TTTAAAGTGT TTTTGAGTGT 3455 

GAAAAGTTGA GTAAAACTAT TTGCAACTGG TTTT CAGAAA AGAGAAAAGA AACAACAAAG 3515 

65 GAATTGAAAC AGGCAGGGAG ATCTTAATAC CTAATTTCAT CATTTCTGCA AAATGTACTG 3 575 

TTTTAGAATG TATTACAATA TCAATGTGAA TATCTTGAAT CCTGTTACAA ATCCTGCACT 3 63 5 



GTATTAAACA TGTAAATTAA TTGTTTGTCT 
GGTATACATG TTTGAAGAAC GTGTAACTCG 
ATAGAAGTGT TTTGGAAGGA AGCATGGTGT 
TCTGTATGAT GTTTGTAAGA CCATGTTTGT 
AAAAAAAAAA AAAAAA 



67 

GATTAGCCAA TCTCACCACC CAAATGGGGA 
GTAATTGATT TGTTCTGATG TTGTAACTCA 
GTGAGACAGT GTCTGTTCTT TTGTGCCAGC 
AAGACATGAA TAAATTGCTG CTTTTGCCCA 
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!2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42S amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met Ala Pro He Thr Thr Ser Arg Glu Glu Phe Asp Glu He Pro Thr 
1 5 10 15 

15 Val Val Gly He Phe Ser Ala Phe Gly Leu Val Phe Thr Val Ser Leu 
20 25 30 

Phe Ala Trp He Cys Cys Gin Arg Lys Ser Ser Lys Ser Asn Lys Thr 

20 40 4S 

Pro Pro Tyr Lys Phe Val His Val Leu Lys Gly Val Asp He Tyr Pro 
50 55 60 

Glu Asn Leu Asn Ser Lys Lys Lys Phe Gly Ala Asp Asp Lys Asn Glu 
^ J b -> 70 75 



80 



Val Lys Asn Lys Pro Ala Val Pro Lys Asn Ser Leu His Leu Asp Leu 
85 90 95 

30 Glu Lys Arg Asp Leu Asn Gly Asn Phe Pro Lys Thr Asn Leu Lys Pro 
100 105 110 

Gly Ser Pro Ser Asp Leu Glu Asn Ala Thr Pro Lys Leu Phe Leu Glu 
35 115 120 1 25 

Gly Glu Lys Glu Ser Val Ser Pro Glu Ser Leu Lys Ser Ser Thr Ser 
130 135 140 

40 ,1* Thr SSr ° 1U Glu Lys Gln Glu Lys Leu G1 y Leu Phe Phe Ser 
^ 150 155 



160 



Leu Glu Tyr Asn Phe Glu Arg Lys Ala Phe Val Val Asn He Lys Glu 
165 170 175 

45 Ala Arg Gly Leu Pro Ala Met Asp Glu Gin Ser Met Thr Ser Asp Pro 
180 185 190 

Tyr He Lys Met Thr He Leu Pro Glu Lys Lys His Lys Val Lys Thr 
50 195 200 205 

Arg Val Leu Arg Lys Thr Leu Asp Pro Ala Phe Asp Glu Thr Phe Thr 
210 215 220 

« Tyr G1 Y Ile Pr ° Tyr Thr Gin He Gin Glu Leu Ala Leu His Phe 

33 225 230 235 240 

Thr He Leu Ser Phe Asp Arg Phe Ser Arg Asp Asp He He Gly Glu 
245 250 255 

60 Val Leu Ile Pro Leu Ser Gly Ile Glu Leu Ser Glu Gly Lys Met Leu 
260 265 270 

Met Asn Arg Glu He Ile Lys Arg Asn Val Arg Lys Ser Ser Gly Arci 

65 275 280 285 

GlY LeU LeU Ile Ser Leu C V S Gln Ser Th * Thr Asn Thr Leu 

290 295 300 
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Thr Val Val Val Leu Lys Ala Arg His Leu Pro Lys Ser Asd Val Ser 
J ° 5 310 315 " 320 

Gly Leu Ser Asp Pro Tyr Val Lys Val Asn Leu Tvr His Ala Lvs Lvs 
32S 330 " 335 * 

Arg lie Ser Lys Lys Lys Thr His Val Lys Lys Cys Thr Pro Asn Ala 
J *° 345 350 

Val Phe Asn Glu Leu Phe Val Phe Asp He Pro Cys Glu Gly Leu Glu 
355 360 3g 5 

Asp lie Ser Val Glu Phe Leu Val Leu Asp Ser Glu Arg Gly Ser Ara 
J70 3 75 3 80 

Asn Glu Val He Gly Gin Leu Val Leu Gly Ala Ala Ala Glu Gly Thr 

J 85 390 -> ^ - 1 



395 



400 



Gly Gly Glu His Trp Lys Glu He Cys Asp Tyr Pro Arg Arg Gin He 
405 410 415 

Ala Lys Trp His Val Leu Cys Asp Gly 
420 425 



